Regression models to relate a scalar $Y$ to a functional predictor $X(t)$ arebecoming increasingly common. Work in this area has concentrated on estimatinga coefficient function, $\beta(t)$, with $Y$ related to $X(t)$ through$\int\beta(t)X(t) dt$. Regions where $\beta(t)\ne0$ correspond to places wherethere is a relationship between $X(t)$ and $Y$. Alternatively, points where$\beta(t)=0$ indicate no relationship. Hence, for interpretation purposes, itis desirable for a regression procedure to be capable of producing estimates of$\beta(t)$ that are exactly zero over regions with no apparent relationship andhave simple structures over the remaining regions. Unfortunately, most fittingprocedures result in an estimate for $\beta(t)$ that is rarely exactly zero andhas unnatural wiggles making the curve hard to interpret. In this article weintroduce a new approach which uses variable selection ideas, applied tovarious derivatives of $\beta(t)$, to produce estimates that are bothinterpretable, flexible and accurate. We call our method "Functional LinearRegression That's Interpretable" (FLiRTI) and demonstrate it on simulated andreal-world data sets. In addition, non-asymptotic theoretical bounds on theestimation error are presented. The bounds provide strong theoreticalmotivation for our approach.
展开▼
机译:将标量$ Y $与功能预测变量$ X(t)$相关的回归模型越来越普遍。该领域的工作集中在估计系数函数$ \ beta(t)$,其中$ Y $与$ X(t)$到$ \ int \ beta(t)X(t)dt $相关。 $ \ beta(t)\ ne0 $对应的区域对应于$ X(t)$和$ Y $之间存在关系的地方。或者,$ \ beta(t)= 0 $的点表示没有关系。因此,出于解释的目的,期望回归过程能够产生在没有明显关系的区域上正好为零的估计值,在剩余区域上具有简单的结构。不幸的是,大多数拟合过程导致对\ beta(t)$的估计很少精确为零,并且存在不自然的摆动,使得曲线难以解释。在本文中,我们介绍一种使用变量选择思想的新方法,该思想应用于$ \ beta(t)$的各种导数,以生成可解释,灵活且准确的估计。我们将我们的方法称为“可解释的功能性线性回归”(FLiRTI),并在模拟的和真实的数据集上进行演示。另外,给出了估计误差的非渐近理论界。界限为我们的方法提供了强大的理论动力。
展开▼